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Art Unit: 2626 

Claim Rejections - 35 USC § 102 
Allowable Subject Matter 

Claims 8-1 1 are allowed. 

The following is a statement of reasons for the indication of allowable subject 
matter: the claims are allowed because the prior arts do not teach including the echo 
canceller as recited in the claims. 

Claims 6 and 20 are objected to as being dependent upon a rejected base claim, 
but would be allowable if rewritten in independent form including all of the limitations of 
the base claim and any intervening claims. 

Claim Rejections - 35 USC § 102 

The following is a quotation of the appropriate paragraphs of 35 U.S.C. 102 that 
form the basis for the rejections under this section made in this Office action: 

A person shall be entitled to a patent unless - 

(b) the invention was patented or described in a printed publication in this or a foreign country or in public 
use or on sale in this country, more than one year prior to the date of application for patent in the United 
States. 

Claims 1, 3-5, 7, 12-15, 17-19 and 21-23 are rejected under 35 U.S.C. 102(b) as 
being anticipated by Berestesky (6,321,194). 

As to claims 1 , 3-5 and 7, Berestesky teaches a method for voice detection in 
audio signals, comprising the steps of: 

Receiving an input audio frame; 

Determining whether the audio frame comprises human voice using fuzzy logic 
algorithm from values of parameter (array of elements) extracted from the audio signals 
(abstract; Col. 2, lines 10-15; Fig.4). 
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According to Berestesky the method of using fuzzy logic technique to analyze the 
characteristics and determine whether the input is voice disclosed in detail 



"When the switch 60 closes, output of spectrum 5 accumulator 58 is provided to feature extraction 
blocks 62, 64, 66 which calculate values based on elements in the array Y.sub.s. A first block 62 
calculates feature L1; a ratio of a sum of lower-frequency spectrum components to a sum of 
higher-frequency spectrum components in Eqn. 2: ##EQU3## 

7 

If the audio signal has a frequency spectrum that spans the range [250, 2500] Hz of frequencies, 
then L1 would be on the order of 1. 

8 

A second block 64 calculates feature L2, a ratio of a maximum value (MAX) of the lower- 
frequency elements in the 15 array to a sum of all other lower-frequency elements in the array: 
##EQU4## 

9 

L2 is a measure of a lower-frequency spectrum shape in the audio signal. For example, if the 
audio signal were a tone with a single frequency component of 480 Hz, then L2 would be 
relatively large since the maximum value (MAX) would be the value of Y.sub.s at a frequency of 
480 Hz and all other frequency components would be much smaller than the maximum value. If, 
on the other hand, the audio signal corresponded to noise, then L2 would be relatively small since 
the maximum value (MAX) is about the same size as all other frequency components in that 
range. 
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10 

A third block 66 calculates feature L3, a duration T of the word: 
L3=T (5) 

11 

L3 is a measure of the length of the word. 

12 

L1, L2, and L3 are used as input values for corresponding fuzzy set blocks A 68, B 70, and 
C 72. Each fuzzy set block output f.sub.i (L), where i .di-elect cons. [A,B,C] and L .di-elect 
cons. [L1,L2,L3], represents a degree of membership in the fuzzy set for a particular value 
of the input feature L. The degree of membership f.sub.i (L) is a value (ranging from 0 to 1) 
of a membership function f.sub.i at point L. Degree of membership f.sub.i (L) shows how 
much the value of the feature (L) is compatible with the proposition that the input signal 16 
represents human speech. FIG. 5 shows an example of a generalized membership function f 80 
as a function of the feature L given in arbitrary units. For a value of L equal to l.sub.1 (at point 82), 
the fuzzy set outputs a value of 0.0 which indicates that the input signal 16 does not represent 
human speech. Similarly, for L equal to l.sub.2 (at point 84), the fuzzy set outputs a value of 0.16 
which indicates that the input signal 16 almost assuredly does not represent human speech. In 
contrast, for L equal to l.sub.3 (at point 86), the fuzzy set outputs a value of 1.0 which indicates 
that the input signal 16 represents human speech." (CoU-Col.5, line 40) 

Claims 12-15, 17-19 and 21-23 are analogous to the claims that are addressed 
above and are rejected by Berestesky for the foregoing reasons. 



In Addition to the rejection above by Berestesky, Claims 12, 22 and 23 are 
rejected under 35 U.S.C. 102(b) as being anticipated by Hamilton (5,450,484). 
As to claims 12, 22 and 23, Hamilton teaches a voice detector comprising: 
Energy estimating means; and 

Voice classifier to classify voice for a frame of an input signal according to the 
energy level as well as other signal characteristics values including the frequency 
spectrum (abstract; Col. 6, line 15-Col.7, line 20). 

According to Hamilton 
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"the present invention shown in FIG. 2 for use in analyzing analog signal 100 which is transmitted 
over the public switched telephone network (PSTN) and which has a 4000 Hz bandwidth, analog 
signal 100 is sampled, in accordance with the Nyquist criterion, at least 8000 times/sec and the 
predetermined number of samples or values per frame is chosen to be 128. Further, in the 
preferred embodiment, in order to increase temporal resolution, a frame of 128 values which is 
input to DSP 65 for Fourier analysis is comprised as follows. The "present" frame comprises the 
last 32 samples or values from the previous frame and the next or "new" 96 samples or values 
which have been obtained from input signal 100. As a result, the "next" frame to be Fourier 
analyzed by the FFT after the "present" frame comprises the 32 "old" samples or values from the 
"present" frame and the next 96 samples or values obtained from input signal 100. Then, prior to 
calculating the FFT, each sample or value S.sub.n (where n=0, . . . , 127) is multiplied by a 
windowing function, the values of which windowing function have been previously stored in 
memory. Various windowing functions which are suitable for such use are well known to those of 
ordinary skill in the art and are advantageous in that their use reduces anomalous spectral 
components due to the finite frame length of 128 samples. 

20 

As a result of the above, when DSP 65 of FIG. 2 is embodied in a Motorola 56000DSP and 128 
samples are used to perform a Fast Fourier Transform (FFT), a 128 bin frequency spectrum for 
the input signal is produced wherein the frequency bins are 62.5 Hz wide. Each frequency bin in 
the frequency spectrum has a bin index denoted by n. However, because the signal is real, only 
the first 64 bins are of interest since the last 64 bins are identical to the first 64 bins. The real and 
imaginary coefficients determined by the FFT for each frequency bin are squared and summed to 
provide a bin energy e(n) for each frequency bin in the frequency spectrum and, in addition, 
the energies for each bin are summed to provide the total energy etot for the frame. Next, a 
predetermined number of energy maxima in the frequency spectrum of the frame are determined. 
An energy maximum is defined as the occurrence of a bin in the frequency spectrum of a frame 
which has more energy than its adjacent sidebins and, in accordance with a preferred 
embodiment of the present invention, the only energy maxima determined are the three largest in 
the spectrum. DSP 65 determines whether a third spectral peak exists in the frame; if so, DSP 65 
sets flag ptotflg=1 and determines the signal-to-noise ratio (SNR=(E.sub.1 +E.sub.2)/E.sub.3 
where E.sub.n is the energy of the nth peak). Then, DSP 65 transmits the total energy of the 
frame, the frequency and energy of the two largest energy peaks, SNR, and flag ptotflg to 
microprocessor 50 for analysis. 

21 

Microprocessor 50 analyzes the output from DSP 65 to detect whether a telephone signal 
has been produced by a voice. In particular, embodiments of the present invention detect 
the initial presence of a voice at the beginning of a telephone call and quickly and 
accurately detect a voice -normally within 100 ms of inception-while avoiding false 
detection during ringback or other telephone network tones and signals. As will be 
described below, the detection decision is based on energy, frequency and signal-to-noise 
characteristics of the input signal. Then, microprocessor 50 characterizes the window as either 
having been produced by a voice or not and all appropriate counters, variables, and flags are 
reset and the loop of collecting frames for the next window is restarted from the beginning. 
Microprocessor 50 then transmits the window characterization information to host computer 30. " 

Response to Arguments 

Applicant's arguments with respect to the claims have been considered but are 



moot in view of the new ground(s) of rejection. 
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Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Daniel D. Abebe whose telephone number is 571-272- 
7615. The examiner can normally be reached on monday-friday. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, David Hudspeth can be reached on 571-272-7843. The fax phone number 
for the organization where this application or proceeding is assigned is 571-273-8300. 

Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). If you would like assistance from a 
USPTO Customer Service Representative or access to the automated information 
system, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000. 
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